Face Pose Estimation in Uncontrolled Environments
نویسندگان
چکیده
Automatic estimation of head pose from a face image is a sub-problem of human face analysis with widespread applications such as gaze direction detection, human computer interaction or video teleconferencing. It can also be integrated in a multi-view face detection and recognition system. Current methods on face pose estimation from a 2D image can be divided into two groups. The first group : geometric shape or template based methods (e.g. [3]) use a set of landmarks such as the relative position of the eyes, mouth, etc. or a template such as an Active Shape Model (ASM) to estimate pose. The second group: manifold learning methods (e.g. [1]) use linear/non-linear embedding methods to learn a lower dimensional space in which they estimate pose. One limitation of current methods is that most of them estimate pose in a limited range and treat pose estimation as a classification problem by assigning the face to one of many discrete poses [3]. However pose estimation is truly a regression problem. Another drawback of current methods is that they have mainly been tested on faces taken in controlled environments i.e. with solid background and small or no variation in illumination and expression. In this paper we propose a probabilistic framework for continuous pose estimation. We use a general image representation that does not rely on locating facial features. This representation is inspired by recent successes of patch-based methods which have shown to be highly effective for other areas of computer vision such as texture generation [2]. We use this representation in a generative model for automatic estimation of head pose in “real world” images ranging from −90◦ to 90◦. Our approach breaks the test image into a non-overlapping regular grid of patches. Each is treated separately and provides independent information about the true pose. There is also a predefined library of object instances. The library can be considered as a palette from which image patches can be taken. We exploit the relationship between the patches in the test image and the patches in the library to estimate the face pose. In inference (see Figure 2), the test image patch is approximated by a patch from the library L (Fig. 2b,c). The particular library patch chosen can be thought of as having a different affinity with each pose. These affinities were learned during a training period and are embodied in a set of parameters W (Fig.2d). The relative affinity of the chosen library patch for each pose is used to determine a posterior probability over pose (Fig. 2e).
منابع مشابه
Face recognition in uncontrolled environments
This thesis concerns face recognition in uncontrolled environments in which the images used for training and test are collected from the real world instead of laboratories. Compared with controlled environments, images from uncontrolled environments contain more variation in pose, lighting, expression, occlusion, background, image quality, scale, and makeup. Therefore, face recognition in uncon...
متن کاملRobust 3D Face Recognition
Face recognition in uncontrolled environments is hindered by variations in illumination, pose, expression and occlusions of faces. Many practical face-recognition systems are affected by these variations. One way to increase the robustness to illumination and pose variations is to use 3D facial images. In this paper 3D face-recognition systems are presented. Their structure and operation are de...
متن کاملNosePose: a competitive, landmark-free methodology for head pose estimation in the wild
We perform head pose estimation solely based on the nose region as input, extracted from 2D images in unconstrained environments. Such information is useful for many face analysis applications, such as recognition, reconstruction, alignment, tracking and expression recognition. Using the nose region has advantages over using the whole face; not only it is less likely to be occluded by acesssori...
متن کاملCamera Pose Estimation in Unknown Environments using a Sequence of Wide-Baseline Monocular Images
In this paper, a feature-based technique for the camera pose estimation in a sequence of wide-baseline images has been proposed. Camera pose estimation is an important issue in many computer vision and robotics applications, such as, augmented reality and visual SLAM. The proposed method can track captured images taken by hand-held camera in room-sized workspaces with maximum scene depth of 3-4...
متن کاملReview on Sparse based Face Recognition Methods in Uncontrolled Environment
Face recognition has been found to be the minimally invasive method among the available biometric identification schemes. It aims to identify or verify a person’s identity by matching input face image against the faces in a database. It is one of the important research areas due to its large practical applications. Most of the systems could well handle the images captured under controlled situa...
متن کامل